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Abstract 

The QXOR-SAT problem is the quantified version of the satisfiabihty problem XOR-SAT 
in which the connective exclusive-or is used instead of the usual or. We study the phase 
transition associated with random QXOR-SAT instances. We give a description of this 
phase transition in the case of one alternation of quantifiers, thus performing an advanced 
practical and theoretical study on the phase transition of a quantified problem. 



1. Introduction 

The last decade has seen a growth of interest in the phase transition for Boolean satisfiability 
(SAT). The clausal version of this problem shows a sharp transition in the sense that, 
as the density of clauses is increased, formulas abruptly change from being satisfiablc to 
being unsatisfiable at a critical threshold point. Numerous experimental studies have been 
performed in the investigation of phase transition for different variants of SAT problems, 
thus giving strong evidence that the location of the transition coincides with such instances 
that are hard to solve. In the meantime, theoretical studies have been conducted in order to 
better understand such transitions. Determining the nature of the phase transition (sharp 
or coarse)^, locating it, determining a precise scaling window and better understanding 
the structure of the space of solutions turn out to be very challenging tasks, which have 
aroused a lot of interest in different communities, namely mathematics, computer science 
and statistical physics (see e.g., Dubois, Monasson, Selman, &; Zecchina, 2001). From a 
computer science point of view, the success of the SAT problem is due to two features. The 
problem SAT provides a framework in which problems within the complexity class NP can 

1. Definitions of sharp and coarse phase transitions can be found on p. 21 in (Janson, Luczak, & Rucinski, 
2000). 
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be adequately expressed, and moreover practically efficient and highly optimized solvers are 
available. 

In order to obtain even stronger systems, many researchers have turned to a powerful 
generalization of Boolean satisfiability, QSAT, where both universal and existential quan- 
tifiers over Boolean variables are permitted. The QSAT problem permits the adequate 
representation and modeling of problems having higher complexity — within the complexity 
class PSPACE — and coming from various fields of computer science such as knowledge rep- 
resentation, verification and logic. Recently, random instances of these quantified problems 
have started to attract some attention (see Gent & Walsh, 1999; Chen &; Interian, 2005). 
Models for generating random instances have been developed, and experimental studies 
have shown that in these models the QSAT property undergoes a phase transition that is 
qualitatively similar to the one that appears for the ordinary SAT property. As stated by 
Chen and Interian (2005), the hope is that research on developing competitive solvers for 
quantified Boolean formulas could benefit from a better understanding of the typical be- 
havior of random instances. Our study follows the pioneering work from Chen and Interian 
(2005) who have made precise a promising model for generating random instances of the 
QSAT problem. We use their model and apply it to the satisfiability problem QXOR-SAT 
that we present below. 

The difficulty of identifying transition factors and of performing theoretical explorations 
of the SAT transition has incited many researchers to turn to a variant of the SAT problem: 
the e-XOR-SAT problem. This satisfiability problem deals with Boolean formulas in con- 
junctive normal form with e variables per clause, in which the "usual or" is replaced by the 
"exclusive or" (we call clauses with "exclusive or" as the only connective XOR-clauses). This 
problem has contributed to develop or validate techniques, thus revealing typical behaviors 
of both random instances and their space of solutions for SAT- type problems (see, e.g., 
Cocco, Dubois, Mandler, & Monasson, 2003; Creignou & Daude, 2003; Dubois, Boufkhad, 
& Mandler, 2000; Dubois k, Mandler, 2002; Pranz, Leone, Ricci-Tersenghi, & Zecchina, 
2001). Therefore, in order to understand how phase transitions evolve for satisfiability 
when introducing quantified variables, it is quite natural to consider this problem. 

Although the phase transition of random k-SAT is not yet well understood, generaliza- 
tion to the quantified version has started for some years. The hope is that a generalization 
of the problem can help in understanding the original one. Another way of gaining insight 
into a hard problem is to look at some tractable variants. For these reasons in this paper 
we embark on a theoretical study of the phase transition for the QXOR-SAT problem, which 
is the quantified version of the XOR-SAT problem. Let us emphasize that this quantified 
problem is, as the usual XOR-SAT problem, in P, and hence will permit us to provide exper- 
iments at a large scale, thus giving useful intuition on the asymptotical behavior of random 
instances. In order to efficiently solve an instance of the XOR-SAT problem, the clause set 
is rewritten to a set of equations with coefficients from the finite field GF{2) and Gaussian 
elimination is performed on the resulting set of equations. Gaussian elimination followed by 
an examination of the quantifier structure provides an algorithm for the quantified version 
of the XOR-SAT problem (for details, see Creignou, Khanna, & Sudan, 2001, chap. 6.4). 

Following the previous studies conducted by Gent and Walsh (1999) as well as Chen 
and Interian (2005), we focus on formulas in conjunctive normal form having two quantifier 
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blocks, namely on formulas of the type ^XJY'^{X, Y), where X and Y denote distinct sets of 
variables, and (/^(X, Y) is a conjunction of XOR-clauses. Moreover, any variable occurring in 
</?(X, Y) occurs in X or Y , i.e., the formula \IX^Y^p{X, Y) is closed. The model has several 
parameters. First we consider (a,e)-QXOR-formulas, which are such that each clause in tp 
has exactly (a + e) variables, o from X and e from Y. The (a,e)-QXOR-SAT property is the 
property for such a formula to be true. The second parameter is a pair (m, n) specifying the 
number of variables in each quantifier block, i.e., in X and Y . The third parameter is L, 
the number of clauses. To sum up, the generated formulas are of the form \fX^Yip{X,Y), 
where X has m variables, Y has n variables, each clause in has a variables from X and e 
from Y and there is a total number of L clauses in ip. We are interested in the probability 
that a formula drawn at random uniformly out of this set of formulas is true as n tends to 
infinity (Section 2). We prove that the nature of the phase transition (coarse or sharp) for 
(a,e)-QXOR-SAT is governed by the number of existential variables occurring in each clause. 
For e = 2 and any a > 1, we prove in Section 3 that the (a,2)-QX0R-SAT has a coarse phase 
transition. Moreover we give an expression of the distribution function of the threshold for 
(a,2)-QX0R-SAT and we show how it is influenced by the different parameters of the model. 
For e > 3, we prove in Section 4 that (a,e)-QXOR-SAT has a sharp phase transition — thus 
getting the first proof of a sharp threshold for a natural quantified satisfiability problem. 

2. QXOR, XOR and the Maximal rank Property 

In this section, we relate QXOR and XOR to the Maximal rank property. We start with some 
definitions and notations. 

2.1 Notation 

An e-XOR-clause, C, is a linear equation over the finite field GF{2) using exactly e distinct 

variables, C = {{xi © ... © Xg) = e) where e = or 1. 

An e-XOR-formula, ip, is a conjunction of not necessarily distinct e-XOR-clauses. A 
truth assignment J is a mapping that assigns or 1 to each variable in its domain, it satisfies 

e 

an XOR-clause C = {{xi © ... © Xe) = e) if and only if I{C) := I{xi) mod 2 = e and it 

i=l 

satisfies a formula p if and only if it satisfies every clause in p. 

We will denote by e-XOR-SAT the property for an e-XOR-formula of being satisfiable. 
An (a,e)-QXOR-/orTOMZa is a closed quantified formula of the following type 

\/X3Yp{XX), 

where X and Y denote distinct set of variables, p{X, Y) is an (a -|- e)-XOR-formula such 
that each clause contains exactly a variables from X and exactly e variables from Y. Such 
a formula is true if, for every assignment to the variables X, there exists an assignment to 
the variables Y , such that p is true. Observe that, for closed formulas, the notions of truth 
and satisfiability coincide. For this reason, we will use the two notions synonymously in the 
following. We denote by (a,e)-QXOR-SAT the property for an (a,e)-QXOR-formula of being 
true. 

Throughout the paper, we reserve m for the number of universal variables (resp. n for 
the number of existential variables), and {xi, . . . , Xm} (resp. {yi, . . . , j/„}) denotes the set 
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of such variables. Note that there are 




(a,e)-XOR-clauses. We consider random formulas ^X3Yip{X,Y) obtained by choosing uni- 
formly independently and with replacement L clauses from all the possible (a,e)-XOR- 
clauses. Using the terminology of Chen and Intcrian (2005), these formulas correspond 
to (a,e)-QXOR((m,n),L)-formulas. We are interested in estimating the probability that 
a randomly chosen (a,e)-QXOR((m,n),L)-formula is true. We denote this probability by 
Pr(jjj„ £,)((a,e)-QXOR-SAT), or shortly Pr((a,e)-QXOR-SAT) when no confusion can arise. 
When restricted to the non-quantified case, e-XOR-SAT, i.e., when a=0, we omit the first 
component in the subscript, thus discussing Pr„ 2, (e-XOR-SAT), or shortly Pr(e-XOR-SAT). 

We will show that the behavior of the (a,e)-QXOR-SAT property is bounded from 
above and below by two monotone properties, namely the e-XOR-SAT property and the 
Maximal rank property. Experiments will suggest that the right parameter in order to study 
these properties is c, the ratio of the number of clauses over the number of existential vari- 
ables. Moreover, according to the results obtained for e-XOR-SAT by Creignou, Daude, and 
Dubois (2003), we know that the transition occurs when c < 1. Therefore, in the sequel we 
will always suppose without loss of generality that L < n. 

2.2 Upper and Lower Bounds for the QXOR-SAT Property 

Note that a random (a,e)-QXOR((m,n),L)-formula can also be considered as the quantified 
linear system 

VX3y {AX + EY = C) (2) 

with coefficient arithmetic in GF{2), where A (respectively E) is a matrix chosen uniformly 
from the set of Boolean L x m (resp. L x n) matrices with exactly a (respectively e) units 
in each row, and C is a Boolean column vector of dimension L chosen uniformly from the 
set of all such vectors. Moreover, A, E and C are chosen independently. 
Observe that the quantified linear system 

MXIY {AX + EY = C) 

is consistent if and only if 

VX {C-AX) e \m{E), 

where lm(£') represents the image of the linear application whose matrix representation is 
E, that is \m{E) = {EY/ Y G {0, l}"}. Hence the quantified linear system is consistent if 
and only if C G \m{E) and \m{A) C \m{E). Therefore, we get: 

Pr((a,e)-QXOR-SAT) = Pr(yX3Y (^X + £;y = C) is consistent) 
= Pr(C € lm(£;) and \m{A) C lm(£;)). 

Thus, on the one hand 

Pr((a,e)-QXOR-SAT) < Pr(C G \m{E)) = Pr(e-XOR-SAT) 
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holds, and on the other hand, the inequahty 

Pr((a,e)-QXOR-SAT) > Pr(lm(^) = {0,1}^) 

holds. Therefore, if Pr„^i(e-Max-rank) denotes the probability that a random matrix from 
the set of L X n Boolean matrices with e units per row is of maximal rank, then for every 
m > a, all n and all L < n, we get the following inequalities: 

Pr„,L(e-Max-rank) < Pr(^ „ i)((a,e)-QXOR-SAT) < Pr„,i(e-XOR-SAT). (3) 

A natural question at this stage is to estimate the probability that a random matrix is 
of maximal rank. In the following we will provide some experiments and theoretical results 
comparing the behavior of the three properties. Maximal rank, QXOR-SAT and XOR-SAT, 
thus making precise the behavior of the (a,e)-QXOR-SAT property according to the value 
of e. 

3. The Case e = 2 

In this section, we restrict our attention to the case where all problems have two existential 
variables in each clause (and the number of all variables is allowed to vary). 

3.1 Experimental Results 

In order to illustrate the inequalities (3) and to compare empirically the three properties, 
we discuss experiments that we have performed. In the experiments, all formulas are closed. 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 



^clauses / #exvars 

Figure 1: The curves for (a,2)-QX0R-SAT, m = n = 10 000 and a varying. 

In all cases, the experiments have been conducted according to the same scheme. Let 
us describe it in detail for Figure 1. One experiment consisted in generating at random (in 
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drawing uniformly and independently) (l,2)-QX0R-formulas over 10 000 existential vari- 
ables and 10 000 universal variables with a ratio "number of clauses/number of existential 
variables" varying from 0.1 to 1 in steps of 0.05 or 0.1. For each of the chosen values of 
ratio, a sample of 1000 formulas were studied using the algorithm described in the work of 
Creignou et al. (2001, chap. 6.4), thus deciding their truth (or satisfiability) as quantified 
formulas. The proportion of true instances for each considered value of ratio has been plot- 
ted in Figure 1. The same has been done for the other (a,2)-QX0R-SAT properties. Hence, 
the different curves are independent from each other. For the 2-XOR-SAT experiment, we 
used the same selection procedure over 10 000 existential variables. Again, Gaussian elim- 
ination together with an examination of the quantifier structure were used to determine 
the logical "status" (true or false) of every formula. Additionally, it has been computed 
whether the matrix E has full rank or not. The curve 2-Max-rank shows the proportion 
of systems with full rank and it corresponds to the 2-XOR-SAT curve in the same figure. 
A close look at Figure 1 reveals that some points from the (a,2)-QX0R-SAT curves are 
slightly below the (theoretical) lower bound given by the curve for 2-Max-rank. The reason 
for this phenomenon is the independence of all the satisfiability curves from each other and 
the "noise" induced by the finite sampling of problems. If we had chosen corresponding 
problems with exactly the same existential part (and only the universal part varies), then 
we would have got all satisfiability curves above the curve for 2-Max-rank. 

The experimental results shown in Figure 1 suggest first that the two bounding proper- 
ties, namely 2-Max-rank and 2-XOR-SAT are distinguishable, second that, when m = n, the 
(a,2)-QX0R-SAT property coincides asymptotically with the 2-Max-rank property indepen- 
dently of a > 1, the number of universal variables per clause. 
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Figure 2: (1,2)-QX0R-SAT when m = 1 and n = 10 000 compared to 2-XOR-SAT and 
2-Max-rank. 
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At another scale for m, for instance when m is constant, the experimental results re- 
ported in Figure 2 suggest that the (a,2)-QX0R-SAT property is in between the two prop- 
erties 2-Max-rank and 2-XOR-SAT. 

In the following, we will validate the conjectures suggested by these experiments and 
prove that the property (a,2)-QX0R-SAT coincides asymptotically with the 2-Max-rank 
property as soon as m is tending to infinity with n, and that it is in between the 2-Max-rank 
property and the 2-XOR property when m is a fixed constant. In particular, we will make 
clear the connection between random (a,2)-QXOR-formulas and random labelled graphs. 

3.2 Bad Cycles and the (a,2)-QX0R-SAT Property 

We are interested in the satisfiability of quantified systems of the form 

s = {yX3Y AX + EY = C), 

where E (respectively A) is a matrix chosen uniformly in the set of Boolean L xn (respec- 
tively Lxm) matrices with exactly 2 (respectively a) units in each row, and C is a random 
column vector of dimension L, in which and 1 occur with the same probability. The satis- 
fiability of such a system is strongly related to the existence of cycles in graphs. Indeed, we 
construct a graph Ga{s) with n vertices and L weighted edges. For each existential variable 
Ui we have a vertex in Ga{s). For each equation 2/i © yj = Xj^ . . . © Xi^ © e, we add the 
edge {ui, Uj} to Ga(s) with the label Xj^ © . . . © Xj^ © e. A cycle is given by a sequence of 
vertices (?/i^, . . . ,yi^) such that for 1 < j < s — 1, {yij,yij_^_i} is an edge in the graph, and 
so is {yis,yii}- The cycle is said to be elementary if all the vertices in the sequence are 
distinct. The weight of a cycle is the sum modulo 2 of the labels of its edges. 

Example 3.1 Let X = {xi, X2, X3} and let Y = {yi, ... , 1/7} . The formula yX3Y ip{X, Y) 
with if{X, Y) being a conjunction of the following equations 



can be represented by the graph in Figure 3. 

In the following, we call a cycle had when it has a nonzero weight, and good otherwise. 

Example 3.2 In the graph associated with the formula described in the previous example, 
there is a good cycle, iyi,y2,y6,y7), whose weight is 0, and a bad one, (2/3,1/4,1/5), whose 
weight is x^- For the latter cycle, the corresponding equations are y^®yA = X2©1, yz®yh = 
X3, and 7/4 © ys = X3 © 1. Adding these three equations yields the equation = X2 which 
cannot be satisfied because X2 £ X is universally quantified. 

For systems containing only existential variables, i.e., a = 0, it has been observed by 
Creignou and Daude (2003) that a 2-XOR- formula is satisfiable if and only if the graph 
Go{s) has no bad cycle, that is : 



yi ©y2 
2/2 ©ya 
ys ©y4 
y4©y5 



X3 

X2 © 1 
X3 © 1 



yi ©y7 
y2 ©ye 
y3©y5 
ye ©yr 



= X2 

= X2 © 1 

= X3 

= Xl © 1 



Pr(2-X0R-SAT) = Pr(Go(s) has no bad cycle). 
Using similar arguments, we get the following proposition. 



(4) 



7 



Creignou, Daude, & Egly 




Figure 3: The graph Ga{s) from Example 3.1 (addition is performed mod2). 

Proposition 3.3 The system s is satisfiable if and only if Ga{s) does not contain any 
elementary bad cycle, i.e., 

Pr((a,2)-QX0R-SAT) = Pr(Ga(s) has no bad cycle). 

Proof: Suppose we have an elementary cycle with nonzero weight in Ga{s). Clearly, to 
such a cycle corresponds a subsystem in s, for which there exists an assignment to X such 
that no assignment to Y will satisfy it (see Example 3.2 for an illustration). Hence, s is 
unsatisfiable. 

Conversely, suppose there is no (elementary) cycle with nonzero weight in Ga{s). Take 
an arbitrary truth assignment / for the (universal) variables in X and apply it to Ga{s). 
Since I{x) G {0, 1} for all x € X, the weight at each edge can be reduced to a constant 
from {0, 1} by addition modulo 2. Moreover, since each cycle in Ga{s) has zero weight, the 
corresponding cycle in the "reduced" version, G'a(s), of Ga{s) has also zero weight. The 
graph G^(s) corresponds to a system with existential quantifiers only. 

In order to obtain a satisfying triith assignment for the existential variables, it suffices to 
apply the following procedure to each connected component of G'^{s). Consider an arbitrary 
root vertex y and assign an arbitrary truth value u to it. We obtain a truth value for each 
vertex in G(j(s) by performing a depth-first search starting from y. During the search, if 
there is an edge (y', y") labelled with s and y" has no truth value yet, then we set the value 
for y" to the value of y' ©e. The assignment obtained in this way satisfies all the equations 
since G'g^{s) does not contain any cycle with nonzero weight. I 

Remark: Observe that 

Pr(2-Max-rank) = Pr(Ga(s) has no cycle) (5) 

holds. 
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3.3 The Distribution Functions for 2-XOR-SAT and 2-Max-rank 

In this section, we will give the exact asymptotical value of the bounds obtained in (3) in 
terms of the order parameter c, where c • n is the number of clauses. For this we will use 
well-known results from random graph theory. 

Let us recall that we consider the classical probabilistic model where each clause/edge 
is chosen uniformly and independently among the 



N 



m\ in 
a J \e 



possible ones. According to Proposition 1.13 in (Janson et al., 2000), if we choose L = c-n 
clauses, then this model is asymptotically equivalent to the one where each clause is drawn 
independently with probability p, where 



holds. So, in the following, we will work with the random labelled graphs Ga{s) associated 
with quantified systems s, with labelled edge probability: 



(a) 

The corresponding probability is usually denoted by jip. However, for simplicity we will 
keep the notation Pr. 

In the light of Proposition 3.3 and of (5), it appears that we have to study 

Pr(Ga(s) has no (bad) cycle). 

The asymptotic behavior of the number of cycles in random graphs has been first investi- 
gated by Erdos and Renyi (1960), and made precise by Janson (1987) and Takacs (1988). 
This number converges in distribution to a Poisson law of parameter A, where A is the limit 
of the average number of cycles as n, the number of vertices, tends to infinity. 

This result can be easily extended to our model of labelled graphs. In particular, 

Pr(Ga(s) has no (bad) cycle) — > cxp(— A), 

where A is the limit of the the average number of (bad) cycles. A challenging task is now 
to get a simple expression of lambda. 

Let Y be the random variable that counts the number of cycles. Let C be the set 
of all possible cycles. For any cycle c, we introduce the random variable Xc such that 
Xc.{Ga{s)) = 1 holds, if and only if Ga{s) contains the cycle c. The average number of 
cycles is 

E{Y) = E{Y,X,)=Y,E{X,). 

ceC ceC 

Since every cycle c of length I has expectation E{Xc) = p' and since the number of cycles 
n{n — 1) . . . {n — I + 1) frri\^ , 



of length I is — • • 2 , we get that 

21 \ a 

l>2 
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holds. Thus, 

lim EiY) = J] M = -i ln(l - 2c) - c 

l>2 

also holds. From (5), we obtain for every < c < ^ that 

lim Pr(„ cn)(2-Max-rank) = exp(- ln(l — 2c) + c) 

n— >+oo \ ' ' 2 

holds, and finally 

lim Pr(„,e„)(2-Max-rank) = i7oo(c) (6) 

is established, where 



exp(c) • (1 - 2c)^/2 for < c < i, 
otherwise. 



The experimental results shown in Figure 4 illustrate this asymptotical behavior. 




^clauses / #exvars 



Figure 4: The curves for the 2- Max- rank property. 

According to (4) 

lirn Pr(„.,„)(2-X0R-SAT) = exp(-Ao) 

holds, where Aq denotes the limit of the average number of bad cycles. In this particular 
case, the weight of a clause is either or 1, which means that half of the cycles are bad. 
Thus, 

A„= ^ "<"-')-J"-' + ^ .('")'.2--.p' = -lln(l-2.)-|, 

l>2 ^ ' 
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Therefore, for every c > 0, we get 

lim Pr(„,,„)(2-X0R-SAT) = exp(i ln(l - 2c) + 

and finally 

liin Pr(„,,„)(2-X0R-SAT) =Fo(c) (7) 

n— >+oo ^ ' 

is established, where 

/exp(c/2) • (1 - 2c)i/4 for < c < i, 
I U otherwise. 

This is illustrated by Figure 5. 



2-XOR-SAT for n=10k - 
2-XOR-SAT for n=20k -e- 
2-XOR-SAT for n=40k - 




0.3 0.4 0.5 0.6 0.7 

^clauses / #exvars 



Figure 5: The curves for the 2-XOR-SAT property. 



3.4 The Distribution Function for (a,2)-QX0R-SAT 

The results obtained in the previous section, (6) and (7), together with the inequalities (3) 
are sufficient to conclude that the (a,2)-QX0R-SAT property has a coarse phase transition 
at the scale L = c - n and that its distribution function is in between the functions Hq and 
Hoo described above. More precisely we get the following theorem. 

Theorem 3.4 For any integer a > 1 and every c > 0, let us consider (a,2)-QX0R((m,n),cn)- 

formulas consisting in the conjunction of c ■ n XOR-clauses, where each clause contains a 
variables from a set of m universal variables, and e variables from a set of n existential 
variables. Then, when n tends to infinity, the (a,2)-QX0R-SAT property has a coarse phase 
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transition whose asymptotical distribution function can be expressed as a function depending 
on m. More precisely, for all c > 0, every a > 1 and m = a 

Pr(a,n,cn)((a,2)-QX0R-SAT) ^n^+oo H{c) 

holds, where 

Hi^c) - ['''^^^^^^^ ~ 2c)V2(i _ 4c2)-V8 forQ<c<\, 
1 otherwise. 

If m is a function of n tending to infinity with n, then, for every a>\, 

P''(^,„,cn)((a.2)-QX0R-SAT) -^„^+oo H^{c) 

holds, where 

Hooic) 



'exp(c) • (1 - 2c)V2 /orO<c<i, 
otherwise. 



Moreover, for every fixed m > a > 1, there exists a distribution function such that 
Pr(„,n,en)((a,2)-QX0R-SAT) ^n-^+oo Hm{c), 

with satisfying 



Hoc < Hm < Ho, 

where 

Ho{c) 



Jexp(c/2)(1 -2c)V4 forO<c<^, 
1 otherwise. 



Proof: The proof is based on Proposition 3.3 and, as discussed in the previous section, on 
an estimation of the number of bad cycles in the labelled graphs associated with random 
formulas. 

Let Xm,a be the limit of the average number of bad cycles. We will give a closed 
expression of Xm,a- Observe that each label on the edges of the graph associated with a 
(a,2)-QX0R((m,n),cn)-formula is formed with a constant, or 1, and a "variable-label" made 
of a universal variables. There are exactly (™) such variable-labels, which are numbered 
from 1 to C^). One can decide whether a cycle is good or bad according to the number 
of 1 (even or odd) and the number of occurrences of each variable-label. Therefore it is 
quite natural to associate to every cycle its length I, the sequence (A'^i, . . . ,N^m-^^) of the 
numbers of occurrences of each variable-label, and the parity e = or 1 of the number of 
occurrences of the constant 1. The limit of the average number of cycles whose parameter 
{I, {Ni , N^m^ ) , e) is fixed is 

21 

Moreover, from such a parameter (I, {Ni, . . . ,N^m-^),e), one can decide whether a cycle is 
bad or not. 
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For a better readability, let us focus on the case a = 1. In this particular case, the label 
of an edge is of the form ©e, where e = or 1 and I < i < m. On the one hand, all cycles 
of odd length are bad (for in the weight of such a cycle at least one of the coefficients of the 
Xi's will be nonzero). On the other hand, there are two kinds of cycles of even length that 
are bad. The ones in which the constant 1 appears an odd number of times, and the ones 
in which one of the universal variables appears an odd number of times. Since, for m > 1, 
we have ^ ^ 

^ \Ni,N2,...,Nm) ^ \Ni,N2,...,Nrr 

Vi Ni=0{2) V ^' ' Si Ni=l{2) V -i' ^' ' " 

we get 

_ ^ (2c)^"+l 1 ^ {2cf^ 1 jlcf- N,^l(2) im,Nl..,Nj . . 

"^'^ 2{2u + 1) 2 (2(2^)) ^ 2 (2(2^)) m' " ^ ^ 

Standard combinatorial computations show that, for even Z, the equation 

Eai Ni=l{2) {Ni, N2,...,NrJ ^ I f ~ ^\ ~ 2^)^ 



k=0 



2m- 1 . jj^l 



holds. Therefore, we rewrite (8) and obtain 



(2c)2"+i ^ (2c)2^ (2c)2'' / /m-l\ (m - 2^)' 



-.1 4l 2(2u + l)^ ^2(2(2^))-^^ 2(2(2.;)) '1^ to V W 2-i ■ ^ 
First, observe that A^^i is a function of c, thus we deduce the last part of the theorem: 
liin Pr(^,,,,„)((l,2)-QX0R-SAT) = exp(-A^,i) = Hm{c). 

n— >+oo V ' ' ^ 

Second, from the above expression of A^^i and using the following inequality 

\ k ) 2"*-i • m'- ~ m' 

we get that 

^l,mJA„,,) = i:M = -iMl-2c)-c 

l>2 

holds, which proves the second statement of the theorem. 
In addition, when m = a = 1, we get the equation 

1 = y i^^l!!!^ + y 4^!^ = -1 ln(l - 2c) + - ln(l - 4c2) - c. 
' V2(2« + l) ^2(2(2u)) 2 ^ ^ 8 ^ ' 

Thus, we have established 
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lim Pr(i,„,,„)((l,2)-QX0R-SAT) = exp(-Ai,i) = H{c), 



where 



H{c) 



exp 



;c)(l-2c)V2(i_4c2)-i/8 forO<c<i 



2' 



otherwise. 



I 

This result is ihustrated by Figure 6, while Figure 7 shows the comparative behavior of 
the three distribution functions Hq, H and H^o. 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 

^clauses / #exvars 

Figure 6: The curves for the (1,2)-QX0R-SAT property with m = I. 



4. The Case e > 3 

It can be observed from the experimental results shown in Figure 8 that, contrary to what 
has been observed in the previous section, the three smooth lines connecting the con- 
secutive points and corresponding to the transition of the three properties 3-Max-rank, 
(a,3)-QX0R-SAT and 3-XOR-SAT are difficult to distinguish. Moreover, when n increases 
(see Figure 9), the curves straighten and come closer one to each other, showing thus strong 
empirical evidence that the transitions of the three properties coincide asymptotically, with 
a sharp phase transition at the critical value C3 ~ 0.918 (which is the critical ratio for 
3-XOR-SAT, see Dubois & Handler, 2002). We will show that, for e > 3, the introduction 
of universal variables in XOR- formulas does not influence the sharp transition. 

Theorem 4.1 For every e > 3 and any integer a, the (a,e)-QXOR-SAT property has a 
sharp threshold which coincides with the one of the e-XOR-SAT property. 
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1^ 

0.9 - 

0.8 - 

0.7 - 

0.6 - 

0.5 - 

0.4 - 

0.3 - 

0.2 - 

0.1 - 





0.8 



H{x) = e"x/]~^(l - 4x2)-i/8 




0.3 0.4 0.5 0.6 0.7 

^clauses / #exvars 



0.8 0.9 



Figure 7: The distribution functions Hq, H and Ho 



#clauses/#exvars — 0.918 — 
3-XOR-SAT for n=lk - 
3-Max-rank for n=lk -- 
(1,3)-QX0R-SAT for m=n - 



0.85 0.9 

#clauses / #exvars 



0.95 




Figure 8: The curves for 3-XOR-SAT, 3-Max-rank and (a,3)-QX0R-SAT for n=1000. 



Proof: Let us recall that 



Pr„,L(e-XOR-SAT) 



Pr(3y {EY = C) is consistent) 
Pr(C G \m{E)) 

Pr(C G V and Im(^) = V), 

yc{o,i}^ 
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\ 




0.2 
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0.1 




(1,3)-QX0R-SAT for m=n 
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1 


1 — 1 


< 



0.8 0.85 0.9 0.95 1 



^clauses / #exvars 

Figure 9: The curves for 3-XOR-SAT, 3-Max-rank and (a,3)-QX0R-SAT for n=2000. 



since E and C are chosen independently. Therefore, if P(r) denotes the probability that a 
random matrix from the set of L x n Boolean matrices with e units per row is of rank r, 
then 

^ 1 
Pr,,L(e-XOR-SAT) = ^ 2^-^P(,) < + ^ (^(L-i) + • • • ^'{o))- 

r=0 

Now observe that P(q) -|- . . . -|- P^^^ = 1, thus -|- . . . -P(o)) = 1 ~ -^(i)' ^'^'^ hence 

we get 

1 + P(L) 
Pr„,L(e-XOR-SAT) < 

Therefore, according to (3), we have 

2 • Pr„,L(e-XOR-SAT) - 1 < Pr„,L((a,e)-QXOR-SAT) < Pr„,L(e-XOR-SAT). 

Since we know that the property e-XOR-SAT exhibits a sharp threshold when L is 0(n) 
(Creignou & Daude, 2003), this shows that (a,e)-QXOR-SAT also does. The same holds for 
the property e-Max-rank (since Pr„_i^(e-Max-rank) = P(l))- particular, for e = 3, we have 
shown that (a,e)-QXOR-SAT as well as 3-Max-rank have a sharp threshold with a critical 
value C3 ^ 0.918, which is the critical ratio for 3-XOR-SAT (Dubois & Handler, 2002). I 



5. Conclusion 

We have (experimentally and theoretically) analyzed the phase transition for the quantified 
problems (a,e)-QXOR-SAT. Our analysis has been conducted at the same level of sophisti- 
cation as the one made for the e-XOR-SAT problem, thus showing that the model proposed 
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by Chen and Interian (2005) is mathematically tangible and provides the good parameters 
in order to perform a mathematical analysis of the phase transition for quantified problems. 

On the one hand, as observed for QSAT (Gent k Walsh, 1999; Chen & Interian, 2005), 
we have proved that the nature of the transition is not influenced by the introduction 
of universal variables. On the other hand, in contrast with QSAT, we have proved that 
the location of the phase transition — the critical ratio — is the same for the two properties 
XOR-SAT and QXOR-SAT, and that the difference of behavior between these two properties 
occurs at the level of the distribution function. 
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